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Abstract 

In this paper we present a hybrid system composed by 
a neural network based estimator system and genetic 
algorithms. It uses an unsupervised Hebbian nonlin- 
ear neural algorithm to extract the principal components 
which, in turn, are used by the MUSIC frequency esti- 
mator algorithm to extract the frequencies. We general- 
ize this method to avoid an interpolation preprocessing 
step and to improve the performance by using a new 
stop criterion to avoid overfitting. Furthermore genetic 
algorithms are used to optimize the neural net weight 
initialization. 

The experimental results are obtained comparing our 
methodology with the others known in literature on a 
Cepheid star light curve. 



1. Introduction 

Periodicity analysis of unevenly collected data is a rel- 
evant issue in several scientific fields. Classical spectral 
analysis methods are unsatisfactory to solve the prob- 
lem. In this paper we present a neural network based 
estimator system which performs well the frequency ex- 
traction in unevenly sampled signals. It uses an unsu- 
pervised Hebbian nonlinear neural algorithm to extract 
the principal components of the signal auto-correlation 
matrix, which, in turn, are used by the MUSIC fre- 
quency estimator algorithm to extract the frequencies 
0, llq]. We generalize this method to avoid an inter- 
polation preprocessing step, which generally adds high 
noise to the signal, and improve the system performance 
by using a new stop criterion to avoid overfitting prob- 
lems. Furthermore, genetic algorithms are used to opti- 
mize the neural net weight initialization, so improving 
its convergence. The experimental results are obtained 
comparing our methodology with the others known in 



literature Q,©, §,[0,1,111. 



2. Evenly and unevenly sampled data 

In what follows, we assume x to be a physical vari- 
able measured at discrete times ti. x(ti) can be writ- 
ten as the sum of the signal Xs and random errors R: 
Xi = x{ti) = Xs{ti)-\-R{ti). The problem we are dealing 
with is how to estimate fundamental frequencies which 
may be present in the signal Xg{ti) 0, R]. 

If X is measured at uniform time steps (even sampling) 
|Q,||rj there are a lot of tools to effectively solve the 
problem which are based on Fourier analysis [[7|,[[l3|. 
These methods, however, are usually unreliable for un- 
evenly sampled data. For instance, the typical ap- 
proach of resampling the data into an evenly sampled 
sequence, through interpolation, introduces a strong 
amplification of the noise which affects the effective- 
ness of all Fourier based techniques which are strongly 
dependent on the noise level [^. 

To solve the problem of unevenly sampled data, we con- 
sider two classes of spectral estimators: 

Spectral estimators based on Fourier Transform (Least 
Squares methods); 

Spectral estimators based on the eigenvalues and 
eigenvectors of the covariance matrix (Maximum Like- 
lihood methods). 

Classic Periodogram 0Jlj, Lomb's Periodogram |g[, 
Scargle's Periodogram |17[, DCDFT 0] are the meth- 
ods of the first class that we use, while MUSIC 0,[^ 
and ESPRIT ||l5|,||l|l belong to the second class. 

The methods based on the covariance matrix are more 
recent and have great potentiality. Starting by this con- 
sideration, we develop a method based on the MUSIC 



estimator. It is compared with classical methods to ev- 
idence the perfomance. 

3. The Neural Estimator 

In the last years several papers dealed with learning in 
PCA neural nets g , [|| , @ , § , |] , [|| finding advan- 
tages, problems and difliculties of such neural networks. 
In what follows, we shall use a robust hierarchical learn- 
ing algorithm because it has been experimentally shown 
that it is the best performing in our problem fia] . 

Our neural estimator (ne) can be summarized as fol- 
lows: 

1 Preprocessing: calculate and subtract the average 
pattern to obtain zero mean process with unity 
variance. 

Interpolate input data if it is the case. 

2 Train the neural network. 

3 Calculate the frequencies estimation by using the 
frequency estimator MUSIC. 

MUSIC takes as input the weight matrix columns of 
the neural network after the learning. The estimated 
signal frequencies are obtained as the peak locations of 
the function of the following equation M : 



MUSIC 



M-E::i|efw( 



where w(i) is the i— th neural network weight vector 
after learning, and e? is the vector 

/ ~ I- ' / J • ■ ■ I ^/ J 

where {ioj^i; •■•)i(L-i)} are the first L components of 
the temporal coordinates of the uneven signal Q . 

When / is the frequency of the i—th sinusoidal com- 
ponent, f — fi, we have e = e.^ and Pmusic -^ cxo. In 
practice we have a peak near and in corrispondence of 
the component frequency. Estimates are related to the 
highest peaks 

Furthermore, to optimize the performance of the PCA 
neural networks, we stop the learning process when 



j]|efw(z)|2<M V/, 



so avoiding overfitting problems. In fact, leaving the 
stop condition used in the ne causes to the ne to find 



periodicities not present in the signal, while the new 
condition preserves it from this problem. 

Since sometimes the weight inizialization can lead to 
local maxima of the objective function, here we pro- 
pose to use genetic algorithms for this aim. 

We use a real encoding of the net parameters jg] in 
a string (chromosome), and then we run the genetic al- 
gorithms with a fitness function, which is the system 
objective function. 



4. Experimental Results 

Many experiments on synthetic and real signals were 
made |Iq] , and in this paper we presents the results ob- 
tained with one spccifc real signal, which highlights the 
main features of our problem. 

The signal is related to the Ccphcid T Hon |jl^. The 
sequence was obtained with the photometric technique 
BVRI and the sampling made from April 1977 to De- 
cember 1979. The light curve is composed by 24 sam- 
ples, and a period of 27.02'*, as shown in figure 1. In 
this case, the parameters of the ne are: A^ = 10, p = 2, 
a = 5, fi = 0.001. The estimated frequency interval 
is [0{1/JD),0.5{1/JD)]. The estimated frequency is 
0.020 (1/JD) (see figure 2). By using the genetic algo- 
rithms (gne)we improve the system performance reach- 
ing the true frequency of 0.034 (1/JD) (see figure 3). 
Lomb's Periodogram finds the correct frequency but 
it generates several spurious peaks (see figure 4). For 
what concerns DCDFT, the frequency interval must be 
changed because it is too much sensitive to very low 
frequencies (the new interval becomes [0.01(1/71?), 
0.5(1/71))]) (see figure 5). Finally ESPRIT does not 
work at all finding a frequency of 0.23 (1/JD) (see fig- 
ure 6). 

In conclusion, only the Lomb's Periodogram and our 
gne are in agreement with the right periodicity, but the 
former showing several spurious peaks. 



5. Concluding Remarks 

In this paper we have illustrated an improved technique 
based on PCA neural networks and MUSIC to estimate 
the frequency of unevenly sampled data. It has been 
shown that it obtains good results on real data (here 
we used the Cepheid T Mon light curve) compared with 
other well known methods. A further improvement can 
be obtain by using filters to extract and identify one fre- 
quency at each time when dealing with multi- frequency 



1.3- 



100 200 300 400 500 600 700 



Figure 1 : Light curve of T Mon 



Figure 3: gne estimate of the fundamental frequency of 
TMon 
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Figure 2: ne estimate of the fundamental frequency of 
T Mon 



Figure 4: Lomb's Periodogram estimate of the funda- 
mental frequency of T Mon 




Figure 5: DCDFT estimate of the fundamental fre- 
quency of T Mon 
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Figure 6: ESPRIT estimate of the fundamental fre- 
quency of T Mon 



signals. In Q, we used our system to detect the Mi- 
lankovic' frequencies from a stratigrafic record. In that 
case the best performance was found when we extracted 
one frequency at each time and eliminating it with a 
FIR filtering. 
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